SEQUENCE LISTING 



110> F . HOFFMANN-LA ROCHE AG 

12 0> PROCESS FOR THE MANUFACTURE OF CAROTENOIDS AND 
BIOLOGICALLY USEFUL MATERIALS THEREOF 

13 0> SOD 

<140> 
<141> 

<150> EP99123821.3 
<151> 1999-12-01 

<160> 17 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 3632 

<212> DNA 

<213> Phaffia rhodozyma 
<220> 

<221> 5*UTR 

<222> (922) . . (923) 

<223> EXPERIMENTAL 

<220> 

<221> exon 

<222> (986) . . (1096) 

<220> 

<221> intron 

<222> (1097) . . (1306) 

<220> 

<221> exon 

<222> (1307) . . (1456) 

<220> 

<221> intron 
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<222> (1457) . . (1555) 



<220> 

<221> exon 

<222> (1556) . . (1589) 

<220> 

<221> intron 

<222> (1590) . . (1694) 

<220> 

<221> exon 

<222> (1695) . . (1799) 

<220> 

<221> intron 

<222> (1800) (1920) 

<220> 

<221> exon 

<222> (1921) . . (1982) 

<220> 

<221> intron 

<222> (1983) . . (2076) 

<220> 

<221> exon 

<222> (2077) . . (2140) 

<220> 

<221> intron 

<222> (2141) . . (2246) 

<220> 

< 2 2 1 > exon 

<222> (2247) . . (2272) 

<220> 

<221> intron 

<222> (2273) . . (2390) 

<220> 
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<221> exon 

<222> (2391) . . (2507) 

<220> 

<221> polyA_site 
<222> (2663) . . (2664) 
<223> EXPERIMENTAL 

<400> 1 

tcctgttgat aatctttcta acgccttgta ctttgaccaa ggcgtttgtc cgaaattttg 60 
caaacttagt gttggtcgca tggacggtct tcggatccag aactgacggc tcgccaataa 120 
agtatgacga tggtagaggt gaaggaggga accacaggtt gaccagtctc aaagagtgct 180 
gatgtgcgcg aggatttgtc attaaatggt gttgtatatg ctagagccaa gagaagacat 24 0 
ttggttttgg ttttggtttt gcatttgatg agatgtgtca cgattgaaga cgggaggagg 300 
ctcactaacc caagaagcca ggatcaggag gaatgcctcc cccttttcat caagatcttt 360 
ctcacatcga acatttgaca ttctctttag tatccttcta tccttttctt ccaacttctc 420 
ccattgtatc gactttgctc gacttgctct tcttatctct gagcagagat gggcattcca 480 
atatcgaagg agcgacacaa gaccttggag tttgggtaac agatgaagag gggccgaggt 540 
ggatggggct gtaggaagta gctgatcgat gagttcctgg atgatgatag gcgaaggaac 60 0 
agacatagga tctctgtctc gtcctggaat tactgagtct tgtatccagc gtgttcttgt 660 
ctcgaagaag ccttcaagat cgatgtaaga taagacaggc aatgaggacg gacgaacgaa 72 0 
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cgaacgaaaa gaacagaaga gctggtaagt cagtcagtca gtcagtcagt caatcaaaca 780 
ctggtgtcta gggttatagc tcgacgcgac gcgacgcgtt tgagacgcga tatgcttacg 840 
taatacctgg cgtcatcccc ccagccgagg caagagccga gccgctcgtg aacgacaaaa 900 
ttcaaaaggc tttctccatc ttaagctcat tctcatctaa ccgactcatc tcgttcccat 960 
cattcccatc attctaccgc catcc atg tct gtt cga gca tec etc tct tec 1012 
gtg tct aga cag act ttc gtc get cct get get ttc cag ate agg gca 1060 
aag cat ace ctg cct gag ctt cct tac get tac gat gtaagacttt 1106 
tccgtgttct cctattcgtc gctttcttgg tttttttcgt cttcgccctc tagctcttct 1166 
tcgtcctttc tgtcctgctc tttgttgttg atattcagct cgatagacta acccatctca 1226 
tctcctggac attcttttac tggaaacgta tcttgtcctt ggtttttctt ggctttggtt 1286 
gaaaattcct ctccactcag gec ctg gag ccc tec ate tec aag gag ate atg 1339 
acc ctt cac cac acc aag cac cat cag act tat gtt aac ggc etc aac 13 87 
get gee gag gag age tac teg gee get gtg ggc aag gag gat gtg ctt 1435 
acc cag gtt aag ctt cag tct gtacgtctga ccgttttttt ategacegga 1486 
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acgcctggtg aggagggaga tgaagtttga tgagcgctca tcgtctagca cgttgacccg 154 6 
atcatacag get etc aag ttc aac gga gga gga cac ate aat c 1589 
gtcagtgata ttcttcaaac tettgetgag caagtcaggt caagctgact gtttcgcttt 1649 
gtttctgegg atctatctca tccttgattt ggcatgatga aacag ac tct ctg ttc 1705 
tgg aag aac ttg get ccc tat gga tec gag gag get ace etc tct gaa 1753 
gga cct etc aag aag get ate gag gaa tct ttt ggt tct ttc gag g 1799 
gtccgtccat ctatcttcct attcagttgt gtttggttcc ggtatactca tctgttttgt 1859 
ttccccacaa aataaaaata aaaatcttgt cctctccggg ggttcgactg caegttcata 1919 
g cc ttc aag aag aag ttc aac get gac acc get get gtc caa gga tec 1967 
gga tgg ggc tgg ctt gtatgtatca tatcctttcc atctcaaact cttctcagag 2022 
tctttttcct tgagacttca aactgactat acatgtttct acaacaaaca acag ggc 2079 
ttg aac ccg ctt act aag aag ctg gaa gtc acc acg acc gee aac cag 2127 
gac cct ctg ctt a gtaagttgtt tctacatgat tttctatctc aacgegatet 2180 
gcatgattcg tcactgattc actggattct cttgtttcgt ttttctcggg atgatttcat 2240 
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aaacag ct cac att cct ate ate gga gtt gac gtgegtatet ttcttgaata 2292 
gtcgtagcgt ctgatctcgt tttattgact gacgtgttgc ttctgtccaa atcattaaaa 2352 
aaaatgaaaa caaataatcg attgaccgac gaaaacag ate tgg gag cac get ttc 2408 
tac ctt cag tac aag aac gtc aag cct gac tat etc get get gtt tgg 2456 
tec gtt ate aac tac aag gag gca gag gee cga ttg cag get get etc 2504 
taa gcgggacgaa aagtaacgac atatgaaggg aggatcaaat ategtttett 2557 
cataaacaac tttcgaggca gatgggagag tacgtacaag agaggtttgt atggagaatt 2617 
gagtttgttg aeggttagea ggttatgata tatgtagcta tagtctagtc taaatctgaa 2677 
agaagagaac aagatggttt gtccgaagag attgagagat caagcccggt catctgatgt 2737 
cgaacaaaca tgccctggtc tgccaacagt ttctagcaca ttatgaccat gttcatgtgt 2797 
aaattgggaa atgagecaga aaggtttatt atctaattca ttgattcatg cgactatgga 2857 
tacatatggg atttccagaa caaacagatg caacaaagca eggcatttte caaagatcga 2 917 
gtcctcccac aagtatgegg caaggtttgt tgttaagaga tataaaagca gacgacaaaa 2977 
caaatcgttt atcgaccctg tgcaccaaca ccgtgaccgt ttgacgagtt ggtagagttg 3037 
tagttgttgc tgttcaaagg agctccagac tggaegctte caagcttcaa caacttctcg 3 0 97 
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gcagcgtcgc tgttcgggaa aagaaaaggc 



ttctctactt cttataggct cttagctcaa 



gacgttctca agctgctgct tggaagcttt 



gaagaagatg tcgaaggcta agggcgatga 



taaaagtgtg acgagatgaa aatggagaaa 



aaaggaatgg aggtgtcacc ggccttccac 



acgaagcagt cggtggcctt agcctcgggc 



taggagagga gaaagagaag atgatgagat 



gaagaagata tgcgaagcaa gacatacact 



<210> 2 
<211> 3375 
<212> DNA 

<213> Phaffia rhodozyma 
<220> 

<221> 5'UTR 

<222> (974) . . (975) 

<223> EXPERIMENTAL 

<220> 

<221> exon 

<222> (1040) . . (1063) 



aaaaaggaac agagcgataa gcatatgtga 3157 
gtcaactcac atgtctttgg cggtaccgaa 3217 
tccgagcttg ccagtaggtc cctggttgga 3277 
aaagcatgaa gatattagct atcggcgcga 3337 
agatgattcg caccatcgac gacctcgacc 33 97 
ttcttgtact cctcaacgtt gacgaagatg 3457 
tcgtagatgc tgatgaaaca caataggtag 3 517 
gtcaggatgc ttgcttcact gtagatggag 3577 
ttggaaagag cttgaaccat tgtag 3632 



Page 7 



<220> 

<221> intron 

<222> (1064) . . (1237) 

<220> 

<221> exon 

<222> (1238) . . (1280) 

<220> 

<221> intron 

<222> (1281) . . (1358) 

<220> 

<221> exon 

<222> (1359) . . (1402) 

<220> 

<221> intron 

<222> (1403) . . (1483) 

<220> 

<221> exon 

<222> (1484) . . (1589) 

<220> 

<221> intron 

<222> (1590) . . (1674) 

<220> 

<221> exon 

<222> (1675) . . (1826) 

<220> 

<221> intron 

<222> (1827) . . (1907) 

<220> 

<221> exon 

<222> (1908) . . (1922) 

<220> 

<221> intron 
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<222> (1923) . . (1992) 



<220> 

<221> exon 

<222> (1993) . . (2056) 

<220> 

<221> intron 

<222> (2057) . . (2131) 

<220> 

<221> exon 

<222> (2132) . . (2157) 

<220> 

<221> intron 

<222> (2158) . . (2238) 

<220> 

<221> exon 

<222> (2239) . . (2293) 

<220> 

<221> intron 

<222> (2294) . . (2376) 

<220> 

<221> exon 

<222> (2377) . . (2426) 

<220> 

<221> intron 

<222> (2427) . . (2524) 

<220> 

<221> exon 

<222> (2525) . . (2542) 

<220> 

<221> polyA_site 
<222> (2667) . . (2668) 
<223> EXPERIMENTAL 
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<400> 2 

cttatccttc tgccgctggt ctctgtctgt cgagtgtgtg ggatggtttt ggatatgttc 60 
ctatacgaaa ggtagcgcag agcaaagctg acagtattaa gcaagacaag agcttctttc 12 0 
tgttgacaga tgaaaggacg aactatgaag ctgtccatgc tccccaaacc gattgacaca 180 
ccgccgtcag gcaacgcaga atttctcact gcttcgacgt cacaccaaca tcgatcctcc 240 
atacctaaaa gcagatcgag acacattgtt ggtcgccatg ttggatggat gtacatcaaa 300 
cccacagcat atatcactca catgtgagaa ctccgtagcc tctaccttct tgtctctcaa 360 
tctgaatgtc tcgttgagag gtggaatgaa tgtttacagt ttgagaagac gaaagaaaga 420 
aagagaagag aagagaggaa tacgtacgac gaagttatca tcgtatggga acttttctaa 480 
aaaactgcct atagtagaga cgatctctgg aggaaagctc tgtagtatga tagtgaagag 54 0 
cgagcaagtc tgggcaagtg catccttcgt ctacaagaaa gagaccagga aatgaaggag 600 
agacgagtaa gcaggtacct accgatattg gatcgttctc tctacccagc gatgccttca 66 0 
ccacgcgttc tatctcttct tgggatggca gatacatact taacgagagc aatctgatgt 720 
ataccgaact tcgaacggaa tgatcccaga atcctcttga acccttgaac ccttgaaccc 78 0 
tggaaccaag taccaaccga gcaacacgcc gatacggtcc acaccacaga accacacgcc 84 0 
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ctcgtcatta aaggtgggac gcgccgatgc tggttacgtt cggcccaatc cggaagttac 900 
cggcttggac gtgcctgtaa ccatgccctg acggtatttc gccttcagct aactccatct 960 
catctttttc ctttactacc acaacccacc cttgaacctt cttccccggc ttttttacta 1020 
tatccatcta tcaatcatc atg get cct tac act ctt ccc gac gtaagcttaa 1073 
agtttgagct gtgtgtgctt atctcaatct tggagttgaa ctcaccgttt tttgtttttg 1133 
cttcctggtt tttttategg catccctcct ttttttcccc tcgtggtcgc atatgatttg 1193 
ctcatcaatc ggcgtttccc atgcatcttt gtcatccgtt tcag ctt cct tac get 1249 
tac gat gec ttg gag cct tac ate tct aag g gtgagattct tagtcagact 13 00 
gttgttccgg ttcgacacga tagctaatcg tctctcgttc ctcaatatga acatgeag 1358 
aa ate atg ate ctt cac cac tec aag cac cat cag act tac gtc 1402 
gtaegtaate taaaggtcat ctccgtctac atggccggat caacttgctc atagatcttc 1462 
cttctgttcg gcgctacgta g acc aac etc aac gec get ate cag get ttc 1513 
tec cag acc aat gac ate aag gee cag ate get ctt cag age get etc 1561 
aag ttc aac gga gga gga cac ate aac c gtacgatcat tctccctctt 1609 
ctggcttatc atatgtgttg cttgtcacta acacgcatgc aaccccggga tatctcaccc 1669 
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tgtag ac tec etc ttc tgg aag aac atg get cct gec gac tct get gat 1718 
gee aag etc ace gag gga teg etc aag act gee ate gac aag gac ttt 1766 
gga tec ttc gag gag ttc aag aag aag ttc aac act get act etc ggt 1814 
gtc cag gga tct gtcagtatct cgtttgcttc gacatactct cagctttcct 1866 
teegtaaact gacgaatagt tttteggaca tgtacttgta g gga tgg gga tgg etc 1922 
gtgcgtttga cctttttcca ctttgaacat tagegatagt gatacctaac aactgtgaat 1982 
tggaatatag gga tac aac acc get ace aag cac etc gag ate gee acc 2031 
acc gec aac cag gat ccc ctt ate a gtatgtgact tctctcgtgt 2076 
ggtcaccata agccagttgc tgacacattt cgttcgctgt ctctcgactt cgtag ct 2133 
ttg act ccc ate att ggt ctt gac gttagtaatt ctatctagtg attggagtcg 2187 
agttctgaac ttgecttgat ctcaaacgaa tgaatcaatt tcttttggta g ate tgg 2244 
gag cac get ttc tac etc cag tac aag aat gtc aag cct gat tac ctt g 22 93 
gtaegtaatt etctattegt ttgccccggt ttgatctttg actcactctt caaaatgttt 23 53 
tcgtttgta actttgaaaa acag cc get ttc tgg aac gtc tgc aac ttt get 24 05 
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gag get cag cga agg ttt gat gtgagtacag gcgctacccc tacggaggaa 2456 

gcgaaggtga gctgaccact ttttatcttt ctgatttgga atgaacgatc cgatgatcaa 2516 
acaaacag get get gtc aag get taa tggtcccatt tatctctttg attcgaegge 2572 

gatgaegget ttctcgcatc cgaagaaggc aaggctatga ttactgttat tetgecatgt 2632 

tgcttgcttt getatgetet atgttctttt cttttgcctc tcttcaaagc caaggegtta 2692 

aggaaggece ttcagtctgt tttacatatg cacatataca tgagaacata tcacggactc 2752 

ggcggctggt ggtcctcttg agegtegget tcaagattag tgtccacacg tgaagcgttc 2812 

ggtgccatcc aacctggtag gaatccccat egggegggaa tccaattatc aattggcggt 2872 

eggecagatt egagcteggg tatctcagaa gcgtcaagcg ggegcattte caggecttta 2932 

agaggagcaa atttaatccg cctgggtgtt cagegagaca cgaacagttt gaaacagagt 2992 

ctgcttgtga gttactegge gagatcactg aggactaaac tttctcagct cgtggacgaa 3052 

aagaacgaac caaaeggtet tccctgtatc tcgaccatct ccttctccat ctcttacaac 3112 

accteggatg aactccaagg cttgctttcc aaagttcaaa caaactccgg gttgccatcc 3172 

acctggtttg tctctaacga gecgagggat atecategtt cggaacgttt gaacagactg 3232 

gatggtaggt ggccggtcgc tteggaagee aatcataatg gtgggaatcg agagaaggaa 3292 
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tgattgggcc cagtgtttaa gacttgtgtt tgttggcaga gtacggacgg aaagtaggac 
agacttaatc aaggcgagcc aag 



<210> 3 
<211> 951 
<212> DNA 

<213> Phaffia rhodozyma 
<220> 

<221> exon 
<222> (1) . . (34) 

<220> 

<221> intron 
<222> (35) . . (115) 

<220> 

<221> exon 

<222> (117) . . (159) 

<220> 

<221> intron 
<222> (160) . . (242) 

<220> 

<221> exon 

<222> (243) . . (363) 

<220> 

<221> intron 
<222> (364) . . (436) 

<220> 

<221> exon 

<222> (437) . . (951) 

<400> 3 
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tec gga age tea gat acc get cga gat cct cga g gtttgtgtgc 44 

tttcgctttg ttcgcatgga tgaagctgtt aacttaaaaa aatcctcgtg tttctctttg 104 

tttcaacata g gt ttc tct ctt aag gtc aag acc tct gag gga aac tgg 153 

gac ttt gtaegtatte ttatcgactg agtcatcaag ctegttateg ctctcttacc 209 

ctcatccttt tgtgtctctg tctacacctc tag gtc gga aac aac act ccc ate 263 

ttt ttc ttg aga gac cca gec aag ttt ccg ate ttc att cac acc cag 311 

aag agg aac ccg cag aca aac tct aaa gac aag gac get ttc tgg gac 359 

tac c gttegtata accttgtcac tccctgcgtg ccgctctgat tcatgttgac 412 
cttgtctttg atataatttt atag ta tec caa aac ccc gag tec gtg cat cag 465 

gtg ctg cac ctg ttc agt gat cga gga acc cct get tct tac cga cac 513 

atg cat ggt tac tct gga cac acc ttc aag atg gtc aac agg aac ggt 561 
gac tgg aat tat gtc cag att cac atg cgc acc gat cag ggt gtc aag 609 
act cac acc aat gaa gag get teg aaa etc gac gec tec aat ccc gat 657 
tea aac gga gac gac ttg ttc gac gca ate aag aat gga gac ttc cct 705 
age tgg acg gtt cag gta cag gta atg tct cct gag cag gee cag aag 753 
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ttc aga tac aac att ctg gat etc acc aag gtc tgg tec cac aag gag 801 

ttc cca ctt agg acg att gga aag ttc act ttg aac cga aac gtg gat 849 

aac tat ttc gca gag gtt gaa cag etc gec ttt get cct tec cat ctg 897 

cct cct gga ate gag ccc teg aac gat ccc gtc ctt cag get cga eta 945 



ttc tec 



<210> 4 
<211> 669 
<212> DNA 

<213> Phaffia rhodozyma 

<220> 

<221> CDS 

<222> (1) . . (669) 



951 



48 



<400> 4 

atg tct gtt cga gca tec etc tct tec gtg tct aga cag act ttc gtc 

Met Ser Val Arg Ala Ser Leu Ser Ser Val Ser Arg Gin Thr Phe Val 
15 10 15 

get cct get get ttc cag ate agg gca aag cat acc ctg cct gag ctt 96 

Ala Pro Ala Ala Phe Gin He Arg Ala Lys His Thr Leu Pro Glu Leu 
20 25 30 

cct tac get tac gat gec ctg gag ccc tec ate tec aag gag ate atg 144 

Pro Tyr Ala Tyr Asp Ala Leu Glu Pro Ser He Ser Lys Glu He Met 
35 40 45 

acc ctt cac cac acc aag cac cat cag act tat gtt aac ggc etc aac 192 
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Thr Leu His His 
50 

get gec gag gag 

Ala Ala Glu Glu 
65 

acc cag gtt aag 
Thr Gin Val Lys 

ate aat cac tct 

lie Asn His Ser 
100 

gag get acc etc 

Glu Ala Thr Leu 
115 

ttt ggt tct ttc 

Phe Gly Ser Phe 
130 

get gtc caa gga 

Ala Val Gin Gly 
145 

aag aag ctg gaa 
Lys Lys Leu Glu 

cac att cct ate 

His lie Pro lie 
180 



Thr Lys His His Gin 
55 

age tac teg gee get 

Ser Tyr Ser Ala Ala 
70 

ctt cag tct get etc 

Leu Gin Ser Ala Leu 
85 

ctg ttc tgg aag aac 

Leu Phe Trp Lys Asn 

105 

tct gaa gga cct etc 

Ser Glu Gly Pro Leu 
120 

gag gee ttc aag aag 

Glu Ala Phe Lys Lys 
135 

tec gga tgg ggc tgg 

Ser Gly Trp Gly Trp 
150 

gtc acc acg acc gec 

Val Thr Thr Thr Ala 
165 

ate gga gtt gac ate 

lie Gly Val Asp lie 

185 



Thr Tyr Val Asn 
60 

gtg ggc aag gag 

Val Gly Lys Glu 
75 

aag ttc aac gga 

Lys Phe Asn Gly 
90 

ttg get ccc tat 

Leu Ala Pro Tyr 

aag aag get ate 

Lys Lys Ala lie 
125 

aag ttc aac get 

Lys Phe Asn Ala 
140 

ctt ggc ttg aac 

Leu Gly Leu Asn 
155 

aac cag gac cct 

Asn Gin Asp Pro 
170 

tgg gag cac get 
Trp Glu His Ala 



Gly Leu Asn 

gat gtg ctt 240 

Asp Val Leu 
80 

gga gga cac 288 

Gly Gly His 
95 

gga tec gag 33 6 

Gly Ser Glu 
110 

gag gaa tct 3 84 
Glu Glu Ser 

gac acc get 432 
Asp Thr Ala 

ccg ctt act 480 

Pro Leu Thr 
160 

ctg ctt act 528 

Leu Leu Thr 
175 

ttc tac ctt 576 

Phe Tyr Leu 
190 
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cag tac aag aac gtc aag cct gac tat etc get get gtt tgg tec gtt 624 

Gin Tyr Lys Asn Val Lys Pro Asp Tyr Leu Ala Ala Val Trp Ser Val 
195 200 205 



ate aac tac aag gag gca gag gec cga ttg cag get get etc taa 

lie Asn Tyr Lys Glu Ala Glu Ala Arg Leu Gin Ala Ala Leu 
210 215 220 



<210> 5 
<211> 222 
<212> PRT 

<213> Phaffia rhodozyma 
<400> 5 

Met Ser Val Arg Ala Ser Leu Ser Ser Val Ser Arg Gin Thr Phe Val 
15 10 15 

Ala Pro Ala Ala Phe Gin He Arg Ala Lys His Thr Leu Pro Glu Leu 
20 25 30 

Pro Tyr Ala Tyr Asp Ala Leu Glu Pro Ser He Ser Lys Glu He Met 
35 40 45 

Thr Leu His His Thr Lys His His Gin Thr Tyr Val Asn Gly Leu Asn 
50 55 60 

Ala Ala Glu Glu Ser Tyr Ser Ala Ala Val Gly Lys Glu Asp Val Leu 

65 70 75 80 

Thr Gin Val Lys Leu Gin Ser Ala Leu Lys Phe Asn Gly Gly Gly His 

85 90 95 

He Asn His Ser Leu Phe Trp Lys Asn Leu Ala Pro Tyr Gly Ser Glu 
100 105 HO 

Glu Ala Thr Leu Ser Glu Gly Pro Leu Lys Lys Ala He Glu Glu Ser 
115 120 125 

Phe Gly Ser Phe Glu Ala Phe Lys Lys Lys Phe Asn Ala Asp Thr Ala 



669 
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130 

Ala Val Gin Gly 
145 

Lys Lys Leu Glu 



His lie Pro lie 
180 

Gin Tyr Lys Asn 
195 

lie Asn Tyr Lys 
210 



135 

Ser Gly Trp Gly 
150 

Val Thr Thr Thr 
165 

lie Gly Val Asp 



Val Lys Pro Asp 
200 

Glu Ala Glu Ala 
215 



140 

Trp Leu Gly Leu 
155 

Ala Asn Gin Asp 
170 

lie Trp Glu His 
185 

Tyr Leu Ala Ala 



Arg Leu Gin Ala 
220 



Asn Pro Leu Thr 
160 

Pro Leu Leu Thr 
175 

Ala Phe Tyr Leu 
190 

Val Trp Ser Val 
205 

Ala Leu 



<210> 6 
<211> 597 
<212> DNA 

<213> Phaffia rhodozyma 

<220> 

<221> CDS 

<222> (1) . . (597) 

<400> 6 

atg get cct tac act ctt ccc gac ctt cct tac get tac gat gec ttg 48 

Met Ala Pro Tyr Thr Leu Pro Asp Leu Pro Tyr Ala Tyr Asp Ala Leu 
15 10 15 

gag cct tac ate tct aag gaa ate atg ate ctt cac cac tec aag cac 96 

Glu Pro Tyr lie Ser Lys Glu lie Met lie Leu His His Ser Lys His 
20 25 30 

cat cag act tac gtc acc aac etc aac gee get ate cag get ttc tec 144 

His Gin Thr Tyr Val Thr Asn Leu Asn Ala Ala He Gin Ala Phe Ser 
35 40 45 
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cag acc aat gac ate aag gec cag ate get ctt cag age get etc aag 192 

Gin Thr Asn Asp lie Lys Ala Gin lie Ala Leu Gin Ser Ala Leu Lys 
50 55 60 

ttc aac gga gga gga cac ate aac cac tec etc ttc tgg aag aac atg 24 0 

Phe Asn Gly Gly Gly His lie Asn His Ser Leu Phe Trp Lys Asn Met 
65 70 75 80 

get cct gee gac tct get gat gee aag etc acc gag gga teg etc aag 288 

Ala Pro Ala Asp Ser Ala Asp Ala Lys Leu Thr Glu Gly Ser Leu Lys 

85 90 95 

act gee ate gac aag gac ttt gga tec ttc gag gag ttc aag aag aag 336 

Thr Ala He Asp Lys Asp Phe Gly Ser Phe Glu Glu Phe Lys Lys Lys 
100 105 110 

ttc aac act get act etc ggt gtc cag gga tct gga tgg gga tgg etc 3 84 

Phe Asn Thr Ala Thr Leu Gly Val Gin Gly Ser Gly Trp Gly Trp Leu 
115 120 125 

gga tac aac acc get acc aag cac etc gag ate gec acc acc gee aac 432 

Gly Tyr Asn Thr Ala Thr Lys His Leu Glu He Ala Thr Thr Ala Asn 
130 135 140 

cag gat ccc ctt ate act ttg act ccc ate att ggt ctt gac ate tgg 480 

Gin Asp Pro Leu He Thr Leu Thr Pro He He Gly Leu Asp He Trp 
145 150 155 160 

gag cac get ttc tac etc cag tac aag aat gtc aag cct gat tac ctt 528 

Glu His Ala Phe Tyr Leu Gin Tyr Lys Asn Val Lys Pro Asp Tyr Leu 

165 170 175 

gee get ttc tgg aac gtc tgc aac ttt get gag get cag cga agg ttt 576 
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Ala Ala Phe Trp Asn Val Cys Asn Phe Ala Glu Ala Gin Arg Arg Phe 
180 185 190 



gat get get gtc aag get taa 



Asp Ala Ala Val Lys Ala 
195 



597 



<210> 7 
<211> 198 
<212> PRT 

<213> Phaffia rhodozyma 



<400> 7 

Met Ala Pro Tyr 
1 

Glu Pro Tyr lie 
20 

His Gin Thr Tyr 
35 

Gin Thr Asn Asp 
50 

Phe Asn Gly Gly 
65 

Ala Pro Ala Asp 



Thr Ala lie Asp 
100 

Phe Asn Thr Ala 
115 

Gly Tyr Asn Thr 
130 

Gin Asp Pro Leu 



Thr Leu Pro Asp 
5 

Ser Lys Glu lie 



Val Thr Asn Leu 
40 

He Lys Ala Gin 
55 

Gly His He Asn 
70 

Ser Ala Asp Ala 
85 

Lys Asp Phe Gly 



Thr Leu Gly Val 
120 

Ala Thr Lys His 
135 

He Thr Leu Thr 



Leu Pro Tyr Ala 
10 

Met He Leu His 
25 

Asn Ala Ala He 



He Ala Leu Gin 
60 

His Ser Leu Phe 
75 

Lys Leu Thr Glu 
90 

Ser Phe Glu Glu 
105 

Gin Gly Ser Gly 



Leu Glu He Ala 
140 

Pro He He Gly 



Tyr Asp Ala Leu 
15 

His Ser Lys His 
30 

Gin Ala Phe Ser 
45 

Ser Ala Leu Lys 



Trp Lys Asn Met 
80 

Gly Ser Leu Lys 
95 

Phe Lys Lys Lys 
110 

Trp Gly Trp Leu 
125 

Thr Thr Ala Asn 



Leu Asp He Trp 
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145 

Glu His Ala Phe 

Ala Ala Phe Trp 
180 

Asp Ala Ala Val 
195 



150 

Tyr Leu Gin Tyr 
165 

Asn Val Cys Asn 
Lys Ala 



155 

Lys Asn Val Lys 
170 

Phe Ala Glu Ala 
185 



160 

Pro Asp Tyr Leu 
175 

Gin Arg Arg Phe 
190 



<210> 8 
<211> 714 
<212> DNA 

<213> Phaffia rhodozyma 

<220> 

<221> CDS 

<222> (1) . . (714) 

<400> 8 

tec gga age tea gat acc 

Ser Gly Ser Ser Asp Thr 
1 5 

gtc aag acc tct gag gga 

Val Lys Thr Ser Glu Gly 
20 

ate ttt ttc ttg aga gac 

lie Phe Phe Leu Arg Asp 
35 

cag aag agg aac ccg cag 

Gin Lys Arg Asn Pro Gin 
50 



get cga gat cct cga ggt 

Ala Arg Asp Pro Arg Gly 
10 

aac tgg gac ttt gtc gga 

Asn Trp Asp Phe Val Gly 
25 

cca gee aag ttt ccg ate 

Pro Ala Lys Phe Pro lie 
40 

aca aac tct aaa gac aag 

Thr Asn Ser Lys Asp Lys 
55 60 



ttc tct ctt aag 48 

Phe Ser Leu Lys 
15 

aac aac act ccc 96 

Asn Asn Thr Pro 
30 

ttc att cac acc 144 

Phe lie His Thr 
45 

gac get ttc tgg 192 
Asp Ala Phe Trp 
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gac tac eta tec 

Asp Tyr Leu Ser 
65 

ttc agt gat cga 
Phe Ser Asp Arg 

tct gga cac acc 

Ser Gly His Thr 
100 

gtc cag att cac 

Val Gin lie His 
115 

gaa gag get teg 

Glu Glu Ala Ser 
130 

gac ttg ttc gac 

Asp Leu Phe Asp 
145 

cag gta cag gta 
Gin Val Gin Val 

att ctg gat etc 

lie Leu Asp Leu 
180 

acg att gga aag 
Thr lie Gly Lys 



caa aac ccc gag tec 

Gin Asn Pro Glu Ser 
70 

gga acc cct get tct 

Gly Thr Pro Ala Ser 
85 

ttc aag atg gtc aac 

Phe Lys Met Val Asn 

105 

atg cgc acc gat cag 

Met Arg Thr Asp Gin 
120 

aaa etc gac gee tec 

Lys Leu Asp Ala Ser 
135 

gca ate aag aat gga 

Ala lie Lys Asn Gly 
150 

atg tct cct gag cag 

Met Ser Pro Glu Gin 
165 

acc aag gtc tgg tec 

Thr Lys Val Trp Ser 

185 

ttc act ttg aac cga 
Phe Thr Leu Asn Arg 



gtg cat cag gtg 

Val His Gin Val 
75 

tac cga cac atg 

Tyr Arg His Met 
90 

agg aac ggt gac 
Arg Asn Gly Asp 

ggt gtc aag act 

Gly Val Lys Thr 
125 

aat ccc gat tea 

Asn Pro Asp Ser 
140 

gac ttc cct age 

Asp Phe Pro Ser 
155 

gec cag aag ttc 

Ala Gin Lys Phe 
170 

cac aag gag ttc 

His Lys Glu Phe 

aac gtg gat aac 
Asn Val Asp Asn 



ctg cac ctg 240 

Leu His Leu 
80 

cat ggt tac 288 

His Gly Tyr 
95 

tgg aat tat 336 

Trp Asn Tyr 
110 

cac acc aat 384 
His Thr Asn 

aac gga gac 43 2 
Asn Gly Asp 

tgg acg gtt 48 0 

Trp Thr Val 
160 

aga tac aac 528 

Arg Tyr Asn 
175 

cca ctt agg 576 

Pro Leu Arg 
190 

tat ttc gca 624 

Tyr Phe Ala 
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195 200 205 

gag gtt gaa cag etc gec ttt get cct tec cat ctg cct cct gga ate 672 

Glu Val Glu Gin Leu Ala Phe Ala Pro Ser His Leu Pro Pro Gly He 
210 215 220 

gag ccc teg aac gat ccc gtc ctt cag get cga eta ttc tec 714 

Glu Pro Ser Asn Asp Pro Val Leu Gin Ala Arg Leu Phe Ser 
225 230 235 



<210> 9 
<211> 238 
<212> PRT 

<213> Phaffia rhodozyma 
<400> 9 

Ser Gly Ser Ser Asp Thr Ala Arg Asp Pro Arg Gly Phe Ser Leu Lys 
15 10 15 

Val Lys Thr Ser Glu Gly Asn Trp Asp Phe Val Gly Asn Asn Thr Pro 
20 25 30 

He Phe Phe Leu Arg Asp Pro Ala Lys Phe Pro He Phe He His Thr 
35 40 45 

Gin Lys Arg Asn Pro Gin Thr Asn Ser Lys Asp Lys Asp Ala Phe Trp 
50 55 60 

Asp Tyr Leu Ser Gin Asn Pro Glu Ser Val His Gin Val Leu His Leu 
65 70 75 80 

Phe Ser Asp Arg Gly Thr Pro Ala Ser Tyr Arg His Met His Gly Tyr 

85 90 95 

Ser Gly His Thr Phe Lys Met Val Asn Arg Asn Gly Asp Trp Asn Tyr 
100 105 110 

Val Gin He His Met Arg Thr Asp Gin Gly Val Lys Thr His Thr Asn 
115 120 125 
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Glu Glu Ala Ser 
130 

Asp Leu Phe Asp 
145 

Gin Val Gin Val 



lie Leu Asp Leu 
180 

Thr lie Gly Lys 
195 

Glu Val Glu Gin 
210 

Glu Pro Ser Asn 
225 



Lys ' Leu Asp Ala 
135 

Ala lie Lys Asn 
150 

Met Ser Pro Glu 
165 

Thr Lys Val Trp 



Phe Thr Leu Asn 
200 

Leu Ala Phe Ala 
215 

Asp Pro Val Leu 
230 



Ser Asn Pro Asp 
140 

Gly Asp Phe Pro 
155 

Gin Ala Gin Lys 
170 

Ser His Lys Glu 
185 

Arg Asn Val Asp 



Pro Ser His Leu 
220 

Gin Ala Arg Leu 
235 



Ser Asn Gly Asp 



Ser Trp Thr Val 
160 

Phe Arg Tyr Asn 
175 

Phe Pro Leu Arg 
190 

Asn Tyr Phe Ala 
205 

Pro Pro Gly lie 



Phe Ser 



<210> 10 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Sodl (sense 
primer for cloning of SOD genes) 

<400> 10 

aarcaycayc aracntaygt naa 



<210> 11 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Sod4 (antisense 
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primer for cloning of SOD genes) 



<400> 11 

gcccanccng anccytgnac ncc 



<210> 12 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Sodl4 (sense 
primer for the construction of S0D1 -disrupting 
plasmid) 

<400> 12 

ggtacctccg atgataggaa tgtgag 



<210> 13 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : SodlB 
(antisense primer for the construction of 
SODl-disrupting plasmid) 

<400> 13 

gaattcagtt caacggagga ggacac 



<210> 14 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence :Sod4 7 (sense 
primer for the construction of S0D2 -disrupting 
plasmid) 

<400> 14 

gaattcggag gaggacacat caaccg 



<210> 15 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Sod4 8 
(antisense primer for the construction of 
S0D2 -disrupting plasmid) 

<400> 15 

ggtacctgta ctggaggtag aaagcg 



<210> 16 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Sod2 (sense 
primer for cloning of CAT gene) 

<400> 16 

mgnttytcna cngtnggngg nga 



<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : Cat 5 (antisense 
primer for cloning of CAT gene) 

<400> 17 

ckrtgnckyt gngtrtcngg rta 
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